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I> : ABSTRACT 



We consider the problem of maximizing the sum of squares of the leading coefficients 
of polynomials Pi-^{x), . . . , Pi^{x) (where Pj{x) is a polynomial of degree j) under the 
restriction that the sup- norm of ^JLi -Pi^. (a^) is bounded on the interval (6 > 0). 

A complete solution of the problem is presented using duality theory of convex analysis 
and the theory of canonical moments. It turns out, that contrary to many other extremal 
problems the structure of the solution will depend heavily on the size of the interval [—6, b]. 



> . 

. 1. Introduction. Let IPj denote the set of all polynomials of degree j, / = {zi, . . . , im} 

CN ■ denote a subset of {1, . . . , n} containing n (i.e. n & I, ii ik if k I) and define 

\o ■ 

O 



Pi := \iPj)jei\Pjerj,jeI, sup Vp2(^)<i| 



■ as the set of all polynomials of degree ii,. . .,im such that the sup-norm of the sum of 
S ■ squares is bounded by 1 on the interval [—6, b]. In the following mi{Pi) denotes the leading 
coefficient of the polynomial Pi G IPi and we are interested in the nonlinear extremal 
^ \ problem 

{Vi) max{j]m2(P,) | (P,),^, G Pi] . 

lei 

For / = {n} (Vi) yields the well known extremal property of the Chebyshev polynomials 
of the ffist kind T„(f ) (see e.g. Natanson (1955), Achieser (1956) or Rivlin (1990)). 
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Similar problems were investigated by Dette (1994a) who considered the maximization of a 
weighted product of the squared leading coefficients of the polynomials Pi{x). All extremal 
problems in these references satisfy a so called "invariance property" which means that if 
a solution on one interval, say [—1, 1], has been determined, then the solution on another 
interval can easily be obtained by a linear transformation from the "optimal" polynomials 
on the interval [—1, 1]. 

In this note we will present a complete solution of the (nonlinear) extremal problem (Vi). 
It will turn out that the above invariance property is not true any longer for the problem 
(Vi) if m > 2. While for sufficently small 6 > the Chebyshev polynomial (of the first 
kind) on the interval [—b, b] of degree max™^]^ ij is a solution of [Vj) (all other polynomials 
are vanishing) this is not true any longer for large b. Here the structure of the extremal 
solution depends heavily on the size of the interval [—b, b] . 

In Section 2 the problem (Vi) is solved by an application of some results in convex analysis 
(see Pukelsheim (1993)) and the theory of canonical moments (see Studden (1981)). It 
turns out that the problem (Vj) is dual to a maximization problem of a concave function 
defined on the set of all probability measures on the interval [—6, b] . This problem appears 
in the theory of optimal experimental design in mathematical statistics (see Dette (1994)). 
While from a statistical point of view the support points and weigths of the optimal measure 
are the main interest it is shown in this paper that the orthogonal polynomials with respect 
to this measure form essentially the solution of the extremal problem (Vj). Section 3 deals 
with some special cases for the set /, namely / = {1, . . . , n} and I = {n — l,n} and some 
explicit examples. Finally, in Section 4, similar problems are investigated which generalize 
the extremal properties of the Chebyshev polynomials of the second kind. 

2. The Solution of (Vj). Throughout this paper ^ is a probability measure on the 
interval [—6, b] and the corresponding orthogonal polynomials with leading coefficient 1 
will be denoted by Pj{x,$,) and their (squared) L2-norm by kj{^) = J^f^Pj{x,$,)d^{x). 
The main step for solving the extremal problem {Vi) is the following duality which is 
proved in the appendix. 

Theorem 2.1. Let S := | kn{C) > 0} and n e I, then 
(Vi) max|^m^(PO | (P,),^/ e P/} = minmax {A;-i(0 \ j e 1} {Vj) 

l€l 

Moreover, if ^* is a solution of the problem ("Dj) and 
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then {y^aj/kj{^*)Pj{x, is a solution of (Vj) where Pj{x,^*) is the jth monic or- 

thogonal polynomial with respect to the measure d^*{x) and the aj are (arbitrary) nonneg- 
ative numbers with sum 1 satisfying 

(2.1) a, =0 tfjeI\M{C) 

(2.2) J]a,fc-i(r)if(x,r) < 1 forallxe[-b,b]. 



The dual problem (Vj) appears in the theory of optimal experimental design in mathe- 
matical statistics and has been solved in the special case / = {1, . . . , n} (see Dette (1994)). 
While statisticians are mainly interested into the support points and weights of the solution 
^* of {T>j) (they give essentially the points where observations have to be taken in a 
polynomial regression) Theorem 2.1 shows that the orthogonal polynomials with respect 
to the measure d^*{x) are needed for the solution of the primal problem (Vi). In order to 
determine these polynomials (and to solve the dual problem {T>j)) some basic facts about 
canonical moments of probability measures on the interval [—b, b] are needed. The Stieltjes 
transform with corresponding continued fraction expansions of such a measure ^ is given 

by 

^ dC{x) _ 1 I _ 26Ci I _ 26C2 I _ 26C3 I _ 26C4 I 
z — X \z-\-b I 1 \ z + b I 1 \ z + b 

1 I (2&)\iC2 I m%c4 I 



|2; + 6(l-2Ci) I 2; + 6(l-2C2-2C3) | 2; + 6(1 - 2(4 - 2(5) 

where Ci = Pi, Cj — Qj-iPj U > 2), g-,- = 1 — pj {j > 1) and < pj < 1 (see e.g. Lau and 
Studden (1988)). The quantities pj are called the canonical moments of ^. Note that pj+i 
is undefined whenever pj e {0, 1} because in this case the continued fraction terminates. 
It is well known that the polynomial in the denominator of the nth convergent is the nth 
monic orthogonal polynomial with respect to the measure d^{x) and that this polynomial 
is given by the continuant (see Perron, Bd. I, (1954), p. 9) 

(2.4) Pn{x,0 = 



^, -(26)2CiC2 -(26)2(3(4 ••■ -(2&)2C2n-3C2n-2 



x + &(l-2Ci) a; + 6(1-2(2-2(3) a: + 6(l-2(2,_2-2(2,_i; 

and has L2-norm 

/b n 

Pl{x,i)di{x) = (26)2- J] (2, _ 1(2,- 

-b j=i 
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(see Chihara (1978) or Wall (1948)). The following theorem determines the canonical 
moments of the solution of the dual problem (X>/). 



Theorem 2.2. The solution ^* of the dual problem (Vj) is uniquely determined by its 
canonical moments P2j-i = | (j = 1) • • • > ^)> P2n — 1 

n—l ^ 

(2-6) = max|^„_j- 1 - 6-2^ {q*2iPli)~^\, ^ j=n-l,...,l. 

i=n— j+l 

where z^-j is 1 or according to n — j G I or n — j ^ I. 

Proof. In the special case / = {1, . . . , n} a proof of Theorem 2.2 can be found in Dette 
(1994), which can be generalized to arbitrary index sets. For the sake of completeness 
we provide a different proof in this paper, which is directly based on the duality result 
of Theorem 2.1 and uses some identities for orthogonal polynomials on campact intervals. 
Let 7„_j = 1 - b-^^ nr=n-j+i iQ2iP2i)~^ {in = 1), then it is easy to see (observing (2.5) 
and (2.6)) that 7n-j > | if and only if kn{^*) = kn-j{^*) and 7n-j < | if and only if 
kniC) < kn-jiC) — 3 £ I)- Consequently we have for the set M{0 in Theorem 2.1 
and the canonical moments defined in (2.6) 

(2.7) n e M{n = {j e / I 7, > \] 

(2.8) pl^ = \ iijiMiC). 

In the following let Pi{x,^*) denote the /th monic orthogonal polynomial with respect to 
the measure d^*{x) and define 

(2.9) a, = n 4 fl - % 

fj, P*2i \ Plj 

which have sum 1 and are nonnegative, by the definition of p^j in (2.6). From Theorem 3.5 
and 4.1 in Dette (1994b) it follows that the orthonormal polynomials k'^^^'^{^*)Pi{x,^*) 
with respect to the measure d^*{x) satisfy 

n 

(2.10) Y.^3kj\c)p!{x,c) = E ^3kj\npK^^c) < i 

for all X G [—6,6]. Note that the result in Dette (1994b) was originally stated on the 
interval [—1,1] but can easily be transferred to the interval [—6, b] and that we have used 
aj > 0, aj = if j ^ M{C), which follows from (2.8) and (2.9). By (2.10) we thus have 



(2.11) {P;{x)hei iJk^/A^^n} e Pj 
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and using the definition of A4{^*) and Oij = 1 we obtain 

JL^m) = E ^"(PD = ftpt = ^^if'i'in I i e /} . 

Therefore we have equahty in Theorem 2.1 for (Pj^jj^/ e Pj and ^* e S and the assertion 
of the theorem foUows. ■ 

Remark 2.3. In the statistical theory the support points and the weights of the optimal 
probability measure (minimizing (X>/)) give the relative frequencies and locations of the 
observations in a polynomial regression. For the special index set / = {!,..., n} this 
measure has been determined explicitly in Dette (1994). 

Remark 2.4. The polynomial P* in the set {P*}j^i vanishes, whenever j ^ Ai{^*) 
(which follows from = \ and (2.9)), however, there might be situations where aj = 
also for some j G M.{^*). Observing the arguments at the end of the proof of the pre- 
ceeding theorem the solution of {Vi) is obtained from (2.11) where the monic polynomials 
(orthogonal with respect to the measure d^*{x)) are given by (2.4) and the quantities 
kj{^*) are obtained from (2.5). This provides a complete solution of the extremal problem 
(Vi). In the following we will discuss some special cases of the set / for which this solution 
becomes more transparent. 



3. Chebyshev polynomials of the first kind. If / = {n}, the solution of (Vi) is 
given by the Chebyshev polynomial of the first kind (on the interval [—6,6]) Tn{f) (see 
Rivlin (1990) or Natanson (1955)). In this Section we will discuss two other sets for which 
the extremal polynomials have a relative simple structure, namely / = {l,...,?i} and 
7 = {n — 1, n}. It turns out that the answer of the question if the Chebyshev polynomial 
of the first kind is also a solution of (Vi) for these sets will depend heavily on the length 
of the interval [—6,6]. We will start with the discussion of the problem (Vi) for the set 
/ = {1, 2, . . . , n}. In the following Un{x) denotes the Chebyshev polynomial of the second 
kind (on the interval [—1, 1]). 

Theorem 3.1. Let / = {1, . . . , n} and 

(3.1) k = minjj G {l,...,n} I t/2n-2i+i(^) > /or z = j, . . . , n| 

then the solution of the extremal problem (Vi) is given by the polynomials {-P;*(a;)}[Li 
where Pi*{x) = ifl<k-l, 



(3.2) P*{x) = A 



b 2 Un-kW b 2 
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(I = k, . . . ,n) and 



Pi 



± 



^t^2n-2/+l(|) 



C^n-fc+l(|) 

The maximum value of {Vi) is given by 



{l = k,...,n). 



Proof. For j = n,...,k define 7j(^*) = 1-6 ''^ Il7=j^+i(^2iP2i) ^ (here we put 
7n(C*) = 1 ^iid the cire defined by (2.6)), then it is straightforward to show that 



(3.3) 



t^2n-2j + l(|) _|_ 1 



j = k,...,n 



and the definition of k in (3.1) and Theorem 2.2 yield for the canonical moments of the 
solution ^* of the dual problem ("D/) p^j = {j = k, . . . ,n). If /c > 2, then it follows 

that 

n— 1 ^ 

-lie) = i-b-'^^^-'+'^iHq*,,?*,,)-' < ^ 



Ik- 



i=k 



and that b <2 which implies (by Theorem 2.2) P2fc-2 = I 7fe-2(C*) < 1 - 2b~^ < |. 
Therefore the canonical moments of the solution ^* of the dual problem (D/) in Theorem 
2.2 are given by 

1 1 * 1 * 1 1 * In 
2 ' 2 ' ' ' ' ' 2 ' 2 ' ^2A;+2' 2 ' ' ' ' ' 2 ' 2 ' 

where — IjiC*) U — . . . ,n) and 7j(^*) is defined in (3.3). By Theorem 2.1 we have 
to find the orthonormal polynomials with respect to the measure d$,*{x) whose monic form 
is given by (2.4) that is 



fc-i 



Piix,n = K 



= K 



&2 

"2" 
X X . 

fc-1 



62 



4 

X 



2 'P2k b Q2kP2k+2 

X X 



' 2 



X 



" 4 

X 



4 
a; 



-P2fe 

a; 



~b'^l2l-AP2l-2 
X X / 

\ 

-1 



X 



X 



X 



I 



(3.4) 



= 2"''+^6'= 



7^fe(^)t^^-fc(|) 



^n-fc+l(|) 
^n-fe(l) 



''Tfe_i(^)C/^_i_fe(|) 
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Here we have used Sylvester's identity (see e.g. Studden (1980), formula (4.12)), (3.3) (for 
j = k) and the recursive definition of the Chebyshev polynomials of the first and second 
kind. Note that the case k=l has to be considered separately but gives the corresponding 
result in (3.4) for A; = 1. The L2-norm of this polynomial is given by (note that = |, 
j = 1, . . . , /c - 1 and p^j. = 7j(r), j = k,...,n) 

1 ' h2k-l TT , (b\ 

k.in = 6-(i)'-V2. n "h-^ph = iiFji^p^ 

j=k+l n-feV2/ 

while the quantities ai in (2.9) are obtained as 

Un-lik)[Un-l+l{k) — Un-l-l{^)] t^2n-2«+l(|) 



ai 



Un-k{-2)Un-k+l{2) C^n-/s ( 2 )^n-fc+l ( 2 ) 

{I — k, . . . ,n). The assertion now follows from Theorem 2.1. ■ 

3.2 Discussion. Theorem 3.1 shows that the structure of the solution of (Vi) changes 
completely with the length of the interval [—b,b]. If 6 < -\/2, then we obtain from (3.1) 
k = n and consequently the sum of the squared leading coefficients of the polynomials 
Pi*, . . . , P* is maximized for the choice Pi* (x) = {1 < I < n-1) and P*{x) = Tn{f) with 
maximum value (2"~-^6~"')^. If 6 > y/2 the situation changes completely. In this case the 
index 1 < A; < n defined by (3.1) depends on n and b. The solution of the problem (Vi) is 
given by (3.2). Finally, if 6 > 2, it follows that k = 1 and (3.2) simplifies to 



,X _ Un+l{^) X, 

-2^ Un-iil) '"2' 



I — 1, ... ,n 



Example 3.3 Let n — 3, then we have to distinguish the following cases: 
A) If 6 < we have k = 3, the optimal polynomials are given by 



,x. 



P^{x) = PUx) = 0, Pi{x) = ±Ts{-) 
and the maximum is lQb~^. 

B) If ^/2 <b< y/S, then k = 2, the optimal polynomials are 



P^{x) = ± 



62 - 1 



- ,x^ b"^ - 



= ± 



1 



2(62 - 1) 



,x. 



7 



and the maximum value is 46 ^(6^ — 1) ^. 

C) If 6 > -s/S, then k = 1, the optimal polynomials are 



2' 62 



1 



6^-362 + l„ 



^3W = ±6^F=(2) ^3^^'(2) 

and the maximum value is (6^ — l)/[6^(6^ ~ 2)]. ■ 

In the remaining part of this section we will consider the index set I = {n — l,n}. Thus 
the problem is to maximize the sum of the squared coefficients 

(3.4) ml_,{Pn-i) + mliPn) 

over the set of all polynomials (of degree n — 1 and n) satisfying 

(3.5) Pn-iix) + Pl{x) < 1 foraU xe[-6,6]. 

The solution of this problem can be obtained by a similar reasoning as in Theorem 3.1 for 
k = n and k = n — 1 and we omit the details in the proof of the following result. 

Theorem 3.4. The polynomials P*_i{x) and P^ix) maximizing (3.4) subject to the re- 
striction (3.5) are given by 



{p:-i{x\p:{x)) = (0'±^n(^)) i/ 6< V2, 



{P:-i{x).P:{x)) = (±^||^T„_i(^), ±^(P^[^'^n(^)-(&'-2)T._2(^)]) 

if b > V2. The maximum values in (3.4) are given by 2"^"^ % if b < and by 
22n-4^-(2n-4)(^2 _ -^yi ^ > respectively. 

Remark 3.5. For index sets of the form Im = {n — m + 1, . . . , n} the corresponding 
results are obtained similar to Theorem 3.4. The values of b where the structure of the 
solution is changing, are obtained successively from (3.1) as 6 = v^, 6 = v^, 6 = ^/2 + -\/2, 
6 = (Vs + V5)/V2, . . . (see also Example 3.3). 



4. Chebyshev polynomials of the second kind. In this section we will briefly discuss 
some generalizations of the extremal properties of the Chebyshev polynomials of the second 
kind. Let / denote a subset of {0, 1, . . . , n} and define 

Pi ■■= {iPj)jei I Pj e IPjJ e /, sup (6^ - x^) ^P/(x) < l| 
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as the set of all polynomials {Pj)j^i such that a weighted sup- norm of the sum of squares 
is less or equal 1. We are interested in the problem 

(-Pi) max{ J] mf(PO | {Pi)iei G Pi] . 

lei 

If / = {n} we obtain the well known extremal proerty of the Chebyshev polynomials of 
the second kind Un{x), if 6 = 1, (see e.g. Achieser (1956), p. 250) and more generally 
of Un{ f)/b, if 6 > 0. For the sake of brevity we will only state the generalizations corre- 
sponding to the index sets / = {0, . . . , n} and / = {n — l,n}. All proofs can be obtained 
by a similar reasoning as in the previous sections and are therefore omitted. 

Theorem 4.1. Let / = {0, . . . , n} and 

(4.1) k = minjj e {0,...,n+ 1} I C/2n-2i+3(^) > /or z = j, . . . , n + l} 

then the solution of the problem (Vi) is given by the polynomials {P^ (a;)}|Lo where Pf (x) = 
0ifl<k-2 and 



(4.2) Pi%x) = Pi 
{I — k — 1, ... , n), where 



b 2 C/n-fc+i(§) b 2 



t^2n-2Z+l(|; 



= ^^7^77 ^ (Z = /c-l,...,n). 



The maximum value of (Vi) is given by 



Remark 4.2. If 6 < \/2 then it follows from (4.1) that k = n + 1 and the solution of 
(Vi) is given by the polynomials Pj^ix) ^ 0, I — 0, . . . ,n - 1, and P*{x) = lUnif). As 
in Discussion 3.2 it follows that for 6 > 2 we have k = 1 and the optimal polynomials 
are "essentially" independent of the interval [—6, b] and proportional to the Chebyshev 
polynomials of the second kind, that is 



V t^2n-2/+l(|) X 
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with maximum value In the interval [v^, 2] we have 1 < A; < n + 1 

(depending on b and n) and the solution of (Vi) is given by (4.1) and (4.2). 

Theorem 4.3. Let b < then the solution of the problem 



(4.3) 



maximize m^_i{Pn-i) + m^{Pn) 



subject to the restriction 



(4.4) 



sup (b' - x')[P^_,ix) + P^ix)] < 1 

xe[-b,b] 



is given by the polynomials P*_i{x) = 0, P*{x) = j;Un{f) with optimum value 2'^'^b 2".+2_ 
If b > \/2 the maximum in (4.3) subject to (4.4) is attained for the polynomials 



Un-2{ 



X, 



with maximum value (2/6)^*^"" ^\b'^ — 1) 



Appendix. (Proof of Theorem 2.1) The proof of Theorem 2.1 follows from a standard 
result in the theory of optimal design in mathematical statistics (see Pukelsheim (1993)). 
To be precise let / = {n, . . . ,im}, i = "^ + XljLi fji^) — O-i^^ ■ ■ ■■> ^^)' (where ' denotes 
transposition) and define for a probability measure ^ on the interval [—6, 6] 



M. 



i(0 = / fj{x)f,{x)'dax) e M^^^'^^^^^^^ {jel) 
J-b 



which is called moment matrix in the theory of optimal design. In the following we will 
collect all matrices Mj^(^), . . . , Mj^(^) in one big matrix 



M(0 

and define two matrices by 



K = 



N = 



\ 



EM' 
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where Cj = (0, . . . , 0, 1)' e M^'^^ is the {j + l)th unit vector {j e /), Ni. are nonegative 
{ij + 1) X {ij + 1) matrices (i.e. N^. > 0) and all other entries in these matrices are 0. 
Defining $_oo(^) = Amin(^) where A e M'^^'^, A>0 and Ainin(^) denotes the minimum 
eigenvalue of A we obtain for the polar function of $_oo (see Pukelsheim (1993), p. 149) 
^'^^{A) = trace{A). By the duality theorem on page 172 in the same reference it now 
follows that (note that kj{^) = [e^M~^(^)ej]-^) 

maxmin{/cj(0 I i e /} = m.aj<.<^_^{{K'M-\^)K)-^) 

= m.in^[^°^^{K'NK)]-^ \ N G iR*^*, > 0, trace{M{^)N) < 1 G s} 
= min|(^e^.iV,ej)-i | Nj e iR(i+i)x(i+i), Nj > Vj G /, 

= min{(^(e;.a,)2)-i | a,- G 1R'+' Vj G /, 5](/,(x)'a,)2 < 1 Vx G [-6,6]} 
jei jei 

= min{(5^m|(P,))-^ | (P,.),^/ e J^/}- 
In order to go from the third to the fourth line in (Al) we have used that 

Y,trace{M,{ON,) = ^ f f,{x)'N,f,{x)d^{x) < 1 V^gS 
jei jei''-^ 

is equivalent to the inequality 

(A2) 5]/,(a;)'iV,/,(a;) < 1 Vo; G [-6, 6] 

jei 

and the fact that the minimum value does not change if the matrices Nj are replaced by 
matrices of the form aja'j (see the following discussion). This proves the first part of the 
Theorem. For the second part we discuss equality in (Al) that is equality in the duality 
theorem in Pukelsheim (1993) (p. 171,172) and obtain 

(^3) Y.^Tace{Mj{e)Nj) = 1 

jei 

e -e'-N- 

r ^ e'-N -e- 
(A5) mini(e'M-\r)e,)"M Ve'iV.e, = V = 1. 
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Observing that k~^($,*) = ejM~^{^*)ej (j = 1, . . . , n) we obtain by straightforward calcu- 
lation as a solution of (A3) and (A4) A^^ = ajajaj where aj = ■\/kj(^*) M~ (C* ) {j £ I)-, 
Oij > (because Nj > 0) and X^je/ = 1- Finally it follows from (A5) that aj = when- 
ever j ^ M{C)- By Corollary 2.3 in Dette (1994b) the polynomials (a;,^*) = a'Jiix) 
are orthonormal with respect to the measure d^* (x) which yield for the monic orthogonal 
polynomials Pi{x, ^*) = ■\/ki{i*)a[fi{x) (/ = 1, . . . , n). Consequently a solution of the right 
hand side of (Al) is given by { a/ ocj/ kj {^*)Pj {x, ^*) } where Pj{x,^*) is the jth. monic 
orthogonal polynomial with respect to the measure d^*{x) and the aj have to satisfy 

j&i jei 
for all X e [—6, b]. This completes the proof of Theorem 2.1. ■ 
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